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(57) Abstract 

Method of detecting a watermark embedded in a signal (S). in which a plurality of frames ({Fi. F2. ... ) of the signal (S) is combined 
to a detection set (Dj) for one detection event. Accoiding to the invention, the reliability of watennark detecUon »s enhanced by using 
non-consecutive frames ({Fi. Fw, ... ) to fomi the detection set (Dj). The invention also relates to an apparatus (2) for recording and/or 
playback of a signal, and to a system for broadcast monitoring, comprising such a watermark detector (24). 
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Watermark detection. 
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The invention relates to a method and a device for detecting a watermark 
embedded in a signal in which method a plurality of frames of the signal is combined to a 
detection set for one detection event. 

The invention also relates to an apparatus for recording and/or playback of a 
5 signal and to a system for monitoring a broadcast signal. 

Watermarks are imperceptible or hardly perceptible marks embedded in a signal 
such as audio, still pictures, animations, video, etc. They are embedded in the signal in such a 
way that they are difficult to remove. A watermark may comprise additional information such 

1 0 as, for example, iiiformation about the source or the copyright status of documents or 
audiovisual programs. Watermarks allow tracing of piracy and support the protection of 
intellectual property. 

A known method of detecting a watermark comprises a correlation step for 
correlating the signal with locally stored watermarks. The result of such a correlation is 

1 5 subsequently evaluated. When the correlation of the signal with a given watermark is, for 
example, larger than a given threshold value, this watermark may be considered to be the 
watermark embedded in the signal. A correlation with many different watermarks may also be 
performed, with the watermark having the highest correlation being considered as the 
watermark that has been used. 

20 Since the signal may be subjected to various signal operations such as MPEG 

compression, DA/AD conversion, NTSC/PAL conversion, as well as cropping and scaling, a 
detection based on single frames is often unreliable in the sense that it leads to many missed 
detections. The article "On the Reliability of Detecting Electronic Watermarks in Digital 
Images" by A.A.C. Kalker, G.F.C, Depovere and J.P.M. Linnartz, published in the 

25 Proceedings ofEusipco 1998 (Rhodes, Greece) gives a plausible explanation for the fact that 
the combination of a plurality of frames, for example n, to a detection set enhances the 
reliability of the detection process by a factor of Vfz. However, this applies only to n 
uncorrelated frames. It is common practice to combine consecutive frames. However, these are 
generally highly correlated. 
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It is an object of the invention to enhance the reliability of detecting a 

watermark. 

To this end, the invention is characterized in that, for forming the detection set, 
at least one frame is used which does not immediately follow an other frame in the signal, 
which other frame is also used for the detection set. The use of frames for a detection set other 
than only consecutive frames has the advantage that it enhances the reliability of detecting a 
watermark. The reason is that frames which are mutually ftirther apart are generally less 
correlated. When using less correlated frames, the enhancement of the reliabihty as applies to 
uncorrelated frames, namely the reUability of the detection process increasing by a factor of 

is better approximated. The use of frames, not all of which occur consecutively in the 
signal, is referred to as interleaving. 

In an embodiment, the method is characterized in that a predetermined number 
of frames is used for the detection set, which frames are present in the signal at a mutually 
predetermined distance corresponding to an integral number of frames. This uniform mode of 
selecting frames for a detection set is a method which is easily applicable. 

In a further embodiment, the predetermined number of frames and the 
predetermined distance are chosen to be such that their greatest conunon divisor equals one. In 
this case, a uniform distribution of I, B and P frames across the various detection sets is 
achieved for an MPEG signal. 

The embodiment as defined in claim 4 is mainly important for an MPEG signal. 
Also in this case, a uniform distribution of I. B and P frames across the various detection sets 
is achieved for an MPEG signal. 

Further embodiments relate to an apparatus for recording and/or playback of a 
signal, comprising a device for detecting a watermark according to the invention, and to a 
system for monitoring a broadcast signal, comprising such a device. 

These and other aspects are apparent from and will be elucidated, by way of 
non-limitative example, with reference to the embodiment(s) described hereinafter. 

In the drawings: 

Fig. 1 shows an embodiment of a device according to the invention, 
Fig. 2 shows an embodiment of an apparatus comprising a device for detecting 
a watermark according to the invention. 



wo 99/45704 ^ PCT/IB99/00293 

Both Figures only show the elements of the device in so far as is necessary for 
understanding the invention. 

Fig. 1 shows a device 1 for detecting a watermark according to the invention, 
comprising means 10 for combining a plurality of frames {Fi, F2, ... } of a signal S to a 
detection set Dj for one detection event. 

According to the invention, at least one frame Fi+a for forming the detection set 
is used, which frame does not immediately follow another frame Fi, also used for the detection 
set, in the signal S. One or more intermediate frames are skipped in this way and used for, for 
example, another detection set. An advantage of using frames {Fi, Fr+d» — }, not all of which 
are consecutive, is that the frames are mutually less correlated, thus enhancing the reliability 
of detecting a watermark. 

Let it be assumed that a number ofn frames {Fj, Fm, ... } is used for a detection 
set Dj. If all consecutive frames {Fi, F2, ... } are used as is afready known, the first decision 
moment occurs after n frames. A consequence of using n frames (Fj, Fi+d» ... }, not all of which 
are consecutive, is that the decision occurs at a later moment. 

The combination of frames {Fj, Fnd, }, not all of which are consecutively 
present in the signal S, may of course be effected in many different ways. Two interleaving 
schemes are discussed by way of example. Within the scope of the invention, it is possible to 
use different schemes yielding comparable results. 

The first scheme to be discussed is uniform interleaving. In this form of 
interleaving, the shortest distance between two frames {Fj, Fi4<j, ...} used for the detection set 
Dj is equal to a constant value d in the signal, corresponding to an integral number of frames of 
the signal. This may be expressed as follows 

Ft e Dj (mod d) 

A total number of frames {Fi, F2, ... } equal to d times n is assigned to a number 
of detection sets Dj equal to d. As an example, the detection sets Dj are shown below with a 
number of frames n = 12 and a mutual distance = 5, 

D\ = {Fi, Fa, Fn, F16, F21, F26, F31, F36, F41, F46, F51, F56}, 
D2 = {F2, F7, F12, Fi7, F22, F27, F32, F37, F42, F47, F52, F57}, 
= {F3, Fs, Fi3. F18, F23, F28, F33, F38, F43, F48, F53, F58}, 
Da = {F4, F9, F|4, Fi9, F24, F29. F34, F39, F44, F49, F54, F59} , 
D5 = {F5, Fio, F\5f F20, F25, F30, F35, F40. F45, F50, F55, F60} 
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For the sake of clarity, this distribution is illustrated with reference to Table I. 
Table I shows the numbers of the frames {Fi, F2, ... } consecutively, as occur in the signal S 

1 2 3 4 5 6 7 S 9 10 11 12 

13 14 15 16 17 18 19 20 21 22 23 24 

25 26 27 2S 29 30 31 32 33 M 35 36 

37 38 39 40 41 42 43 44 45 46 47 48 

49 50 51 52 53 54 55 56 57 58 59 60 

5 Table I: Illustration of uniform interleaving, n = 12, = 5, 

The consecutive frames {Fi, F2, ... } of the signal S are shown as d series ofn 
frames. This does not need to be a physical separation of the frames {Fi, F2, ... } of the signal, 
although this is possible, but may also be considered to be a conceptual notion so as to 

10 elucidate the invention. The different fonts indicate that the frames {Fi, F2, ... } are used in 
different detection sets Dj. A detection set Dj comprises frames {Fi, Fi+d, ... } with the numbers 
indicated by the same font in Table I. With this form of interleaving, the first detection 
decision takes place after rf(«-l)+l frames. The process is repeated after dn frames. 

It is important for many applications that the watermark survives a compression 

1 5 such as, for example, MPEG. The different frames in an MPEG signal: I, B and P frames may 
behave differently as regards survival of a watermark. When using uniform interleaving, there 
may be a problem for some values ofn and d. For the purpose of illustration, an example is 
given with « = 12 which, in this case, is also a Group of Picture (GOP) size of an MPEG 
signal S. The distance between the frames is = 4. The distribution of the frames {Fi, Fi+d, } 

20 is shown in Table II. 

IBBPBBPBBPBB 
"1 2 3 4 5 6 7 A 9 10 JI 12 
13 14 15 16 17 18 J9 20 21 22 23 24 
25 26 27 28 29 30 31 21 33 34 35 M 
37 38 39 40 41 42 43 44 45 46 47 48 



Table II: Illustration of uniform interieaving, /i = 12, £/= 4 at MPEG, 
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As is evident from Table II, the different detection sets Dj comprise a non- 
unifomi distribution of MPEG frames. For example, all I frames occur in one detection set Dj. 
Solutions within the scope of the invention are possible for a more uniform distribution of I, B 
and P frames across the detection sets Dj. 
5 When using imiform interleaving, it is, for example, possible to choose n and d 

in such a way that their greatest common divisor equals one. This is, for example, the case for 
n-l2 and = 5, as already discussed hereinbefore. See Table III. 

IBBPBBPBBPBB 
i 2 3 4 5 6 7 8 9 10 11 12 
13 14 15 16 17 18 19 20 21 22 23 24 
25 26 27 28 29 30 31 32 33 M 35 36 
37 3 8 39 40 41 42 4 3 44 45 46 47 48 
49 50 51 52 53 54 55 56 57 58 59 60 

Table HI: Illustration of uniform interleaving, /? = 12, c? = 5 at MPEG. 

10 

It can clearly be seen that each detection set Dj comprises a frame from each 
column. This means that each type of frame occurs in each detection set Dj. 

It is further possible, for example, to make the uniform interval more flexible. 
This may be effected, for example, by means of diagonal interleaving. With n being an 
15 integral number of times a GOP size of an MPEG signal S, a uniform distribution of I, B and P 
frames is achieved with the expression for diagonal interleaving, which may be defined as 
follows: 

Fi € Z), <=> / = 1 + {/-l)rt +m(/i+l) (mod dn) voor m e {0, 1,..., nA } 

20 

For K = 12 and = 5, the following detection sets Dj are obtained. 



D\ = {Fu ^6, F\\y ^14, Fi9, FiAj Fn, Fyi, F40, Fsz, Fs%}, 
Di = {F5. Fio, Fi3, F18, F23, Fiey F3U F36, F44, F47, F52, F57}, 
25 D2 = {F4, F9, Fi7. F22, F25. F30, F35, F38, F43, F48, F51, F56} . 

Da = {F3, Fg, F16, F21, F29, F34, F37, F42, F47, F50, F55, F60}. 
Ds = {F2, F7, F12, Fi5, F20, F28, F33, F41, F46, Fa9, F54, F59} 
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The distribution of the frames (Fj, Fi+d, ... } is shown in Table IV. 
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Table IV: Illustration of diagonal interleaving, /j = 12, = 5. 

5 As ab-eady explained hereinbefore, the consecutive frames (Ft, F2, ... } of the 

signal S are shown as d series oin frames. The number of frames per series is equal to the 
number of frames in a detection set, namely «. In this case, this number is fiirther equal to an 
integral number of times the GOP size of an MPEG signal S. It is evident from Table IV that 
the position in the signal S of a frame Y\ used in a detection set Dj shifts in each subsequent 

1 0 series by one position with respect to the start of the series. With respect to this frame Fi, 

which is here referred to as reference frame for the sake of simplicity, the other frames Fi+<i for 
the same detection set Dj are present in the same series at a distance which is equal to an 
integral number of times the number of series ± It may be noted that the average and most 
frequently occurring distance between the frames {Fi, Fj+a, } in a detection set Dj equals d = 

1 5 5, alternated by other distances such as 3 and 8 in this example. 

Fig. 2 shows an apparatus, for example a DVD player 2, comprising means 21 
for reading a DVD disc so as to obtain a DVD signal S. The DVD signal S is an MPEG bit 
stream which is stored on the DVD disc. The DVD signal S is applied fix)m an output 23 of the 
DVD player 2 via a switch 22. The output 23 is connected to an external MPEG decoder and a 

20 picture display device (not shown). For, for example, copyright protection it is assumed that 
the DVD player 2 cannot play video signals S with a predetermined watermark, unless other 
conditions are met which are not important for the invention. Watermarked signals S may be 
played back only when, for example, the disc comprises a given * Vobble" key. To be able to 
detect a watermark, the DVD player 2 comprises a watermark detector 24 which is adapted as 

25 described with reference to Fig. 1 . The watermark detector 24 receives the stored signal S and 
controls the switch 22 in dependence upon the detected watermark. 



wo 99/45704 ^ PCT/IB99/00293 

The invention may be further used, inter aha, in a system for broadcast 
monitoring. Broadcasts may be coded in advance with a watermark identifier. Monitoring 
television broadcasts with a system comprising a watermark detector 24 provides a mechanism 
for intellectual property protection and statistic data collection. 
5 It should be noted that the above-mentioned embodiments illustrate rather than 

limit the invention. Those skilled in the art will be able to conceive alternative embodiments 
without departing from the scope of the appended claims. 

Reference numerals between parentheses in the claims have been incorporated 
to elucidate these claims and should not be construed as limiting the claim. The word 
10 "comprising", and derivatives thereof do not exclude the existence of elements or steps other 
than those defined in a claim. The invention may be implemented by separate elements and by 
a correctly programmed computer. In the claims relating to a product, in which several means 
are mentioned, several of these means may be implemented in one and the same piece of 
hardware. 

15 In summary, a method of detecting a watermark embedded in a signal S is 

described, in which a plurality of frames {Fi, F2, ... } of the signal S is combined to a detection 
set Dj for one detection event. According to the invention, the reliability of watermark 
detection is enhanced by using non-consecutive frames {Fj, Fj+d, ... ) to form a detection set 
Dj. The invention also relates to an apparatus for recording and/or playback of a signal, and to 

20 a system for broadcast monitoring, comprising such a watermark detector. 



1 
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CLAIMS: 



1 . A method of detecting a watermark embedded in a signal (S), in which method 
a plurality of frames {Fi, F2, ... } of the signal (S) is combined to a detection set (Dj) for one 
detection event, 

characterized in that, for forming the detection set (Dj), at least one frame (Fj+d) 
5 is used which does not inmiediately follow an other frame (Fi) in the signal (S), which other 
frame (Fi) is also used for the detection set (Dj). 

2. A method as claimed in claim 1 , 

characterized in that a predetennined number of frames ({Fi, Fi+d, ... }) is used 
10 for the detection set (Dj), which frames ({Fi, Fj+a, ... }) are present in the signal (S) at a 

mutually predetermined distance corresponding to an integral number of frames ({Fi,, F2, ... }). 

3 . A method as claimed in claim 2, 

characterized in that the predetermined number of frames ({Fi, Fj+d, ... }) and 
1 5 the predetennined distance are chosen to be such that their greatest common divisor equals 
one. 



4. A method as claimed in claim I , 

characterized in that a predetermined number of frames ({Fi, Fi+<j, ... }) is used 
20 for each detection set (Dj ) which number of frames ({Fj, Fnd, ... }) corresponds to an integral 
number of times a GOP size of an MPEG signal (S), 

consecutive frames ({Fi, F2, }) in the signal (S) being divided into a plurality 
of series of a number of consecutive frames ({Fi, F2, ... }), which number of frames ({Fi,, F2, 
... }), equals the number of frames ({Fi, Fi+a, ... }) in the detection set (Dj), 
25 and a reference frame (Fi) is chosen from each series, the reference frames (Fj) 

in consecutive series being shifted by one position, calculated from the start of the series, 

and the reference frame (Fi) as well as frames (Fj+d) present at a distance from 
the reference frame (Fi) corresponding to an integral number of times the number of series are 
used for the detection set (Dj). 
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5. A device (1) for detecting a watermark embedded in a signal (S), the device (1) 
comprising: 

means for receiving the signal (S); and means (10) for combining a plurality of 
frames {Fi, F2, ... } of the signal (S) to a detection set (Dj) for one detection event, 

characterized in that the means (10) are adapted to use at least one frame (Fj+j) 
which does not immediately follow another frame (Fi), also used for the detection set (Dj), in 
the signal. 

6. An apparatus (2) for recording and/or playback of a signal (S), comprising 
means (21) for receiving the signal (S); 

characterized in that the apparatus (2) further comprises a device (24) as 
claimed in claim 5 for detecting a watermark embedded in the signal (S). 

7. A system for monitoring a broadcast signal (S), comprising means for receiving 
the signal (S); 

characterized in that the system fiirther comprises a device (24) as claimed in 
claim 5 for detecting a watermark embedded in the broadcast signal (S). 



1 
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S = {Fi.F2,...} 
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Dj = {Fi.Fi+d....} 



FIG.1 




